Picture for Rui Min

Rui Min

Scalable Token-Level Hallucination Detection in Large Language Models

Add code
May 12, 2026
Viaarxiv icon

RemoteZero: Geospatial Reasoning with Zero Human Annotations

Add code
May 06, 2026
Viaarxiv icon

RemoteAgent: Bridging Vague Human Intents and Earth Observation with RL-based Agentic MLLMs

Add code
Apr 09, 2026
Viaarxiv icon

GeoBrowse: A Geolocation Benchmark for Agentic Tool Use with Expert-Annotated Reasoning Traces

Add code
Apr 05, 2026
Viaarxiv icon

Mitigating Safety Tax via Distribution-Grounded Refinement in Large Reasoning Models

Add code
Feb 02, 2026
Viaarxiv icon

Empowering Reliable Visual-Centric Instruction Following in MLLMs

Add code
Jan 06, 2026
Viaarxiv icon

EcomBench: Towards Holistic Evaluation of Foundation Agents in E-commerce

Add code
Dec 11, 2025
Figure 1 for EcomBench: Towards Holistic Evaluation of Foundation Agents in E-commerce
Figure 2 for EcomBench: Towards Holistic Evaluation of Foundation Agents in E-commerce
Figure 3 for EcomBench: Towards Holistic Evaluation of Foundation Agents in E-commerce
Figure 4 for EcomBench: Towards Holistic Evaluation of Foundation Agents in E-commerce
Viaarxiv icon

Reasoning Path Divergence: A New Metric and Curation Strategy to Unlock LLM Diverse Thinking

Add code
Oct 30, 2025
Viaarxiv icon

WebResearcher: Unleashing unbounded reasoning capability in Long-Horizon Agents

Add code
Sep 16, 2025
Figure 1 for WebResearcher: Unleashing unbounded reasoning capability in Long-Horizon Agents
Figure 2 for WebResearcher: Unleashing unbounded reasoning capability in Long-Horizon Agents
Figure 3 for WebResearcher: Unleashing unbounded reasoning capability in Long-Horizon Agents
Figure 4 for WebResearcher: Unleashing unbounded reasoning capability in Long-Horizon Agents
Viaarxiv icon

Evaluating Large Language Model with Knowledge Oriented Language Specific Simple Question Answering

Add code
May 22, 2025
Figure 1 for Evaluating Large Language Model with Knowledge Oriented Language Specific Simple Question Answering
Figure 2 for Evaluating Large Language Model with Knowledge Oriented Language Specific Simple Question Answering
Figure 3 for Evaluating Large Language Model with Knowledge Oriented Language Specific Simple Question Answering
Figure 4 for Evaluating Large Language Model with Knowledge Oriented Language Specific Simple Question Answering
Viaarxiv icon